Dynamic Stream Weighting for Turbo-Decoding-Based Audiovisual ASR
نویسندگان
چکیده
Automatic speech recognition (ASR) enables very intuitive human-machine interaction. However, signal degradations due to reverberation or noise reduce the accuracy of audio-based recognition. The introduction of a second signal stream that is not affected by degradations in the audio domain (e.g., a video stream) increases the robustness of ASR against degradations in the original domain. Here, depending on the signal quality of audio and video at each point in time, a dynamic weighting of both streams can optimize the recognition performance. In this work, we introduce a strategy for estimating optimal weights for the audio and video streams in turbo-decodingbased ASR using a discriminative cost function. The results show that turbo decoding with this maximally discriminative dynamic weighting of information yields higher recognition accuracy than turbo-decoding-based recognition with fixed stream weights or optimally dynamically weighted audiovisual decoding using coupled hidden Markov models.
منابع مشابه
Turbo Decoders for Audio-Visual Continuous Speech Recognition
Visual speech, i.e., video recordings of speakers’ mouths, plays an important role in improving the robustness properties of automatic speech recognition (ASR) against noise. Optimal fusion of audio and video modalities is still one of the major challenges that attracts significant interest in the realm of audiovisual ASR. Recently, turbo decoders (TDs) have been successful in addressing the au...
متن کاملA Turbo-Decoding Weighted Forward-Backward Algorithm for Multimodal Speech Recognition
Since the performance of automatic speech recognition (ASR) still degrades under adverse acoustic conditions, recognition robustness can be improved by incorporating further modalities. The arising question of information fusion shows interesting parallels to problems in digital communications, where the turbo principle revolutionized reliable communication. In this paper, we examine whether th...
متن کاملNovel decoding algorithm with weighting of extrinsic information for punctured turbo-codes
Turbo-code is known as the powerful error correcting code. The principle of turbo-code, which is the concatenated convolutional code, makes it possible to control the code rate by the use of the puncturing. When the puncturing is applied, the reliability of the decoded bits is not uniform over a frame, because some bits are decoded using only received information bits, and the others are decode...
متن کاملFast Correlation Attacks Based on Turbo Code Techniques
This paper describes new methods for fast correlation attacks on stream ciphers, based on techniques used for constructing and decoding the by now famous turbo codes. The proposed algorithm consists of two parts, a preprocessing part and a decoding part. The preprocessing part identi es several parallel convolutional codes, embedded in the code generated by the LFSR, all sharing the same inform...
متن کاملComdined Turbo Block Decoding and Equalisation
In this paper, the combination of equalization and turbo decoding is studied. In the iterative decoding of a product code in block turbo coding system, the equalization process is performed within the iteration loop. The present study aims to investigate the decision feedback equalizer (DFE) incorporated in the iterative decoding. Simulation results show that the more severe the channel interfe...
متن کامل